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Abstract: One of the primary forestry research interests lies in estimating forest stand parameters by applying empirical or semi-empirical 
model to establish the relationship between the forest stand parameters and remote sensing data. Using remote sensing image and the inven¬ 
tory data from 2 compartments in northeast Florida, U.S.A., this paper explored the correlation between forest stand parameters and Landsat 
TM spectral digital number (DN) value. Results showed that less than 50% of the total variance could be explained by linear regression 
models with only either a single band or such vegetation indices as vegetation index (VI) or normalized difference vegetation index (NDVI) 
as predicators. In consequence, multi-linear regression models which synthesized more predicators were introduced to estimate forest pa¬ 
rameters. Regression results were tested in terms of the other group of data, and verification showed a better capability of explaining over 
75% variance except for forest density. The weakness and further improvement of prediction models were also discussed in the article. This 
paper is expected to provide a better understanding of the relationship between TM spectral and forest characteristics 
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Introduction 

Recently remote sensing has been intensively used in forestry 
and revolutionized the methodology of traditional inventory. 
Forest stand parameters, such as age, volume or biomass, aver¬ 
age stem diameter and height, are important to assess forest re¬ 
sources (Mallinis et al. 2004). Traditional inventory of forest 
stand parameters based on fieldwork is often difficult, costly and 
time-consuming to conduct, especially in a large area (Lu et al. 
2004). TM image, as a widely used remote sensing data, has long 
time been used as an auxiliary data to obtain information on 
forest resources (Reese et al. 2002; Makela and Pekkarinen 
2004). These auxiliary data, together with field inventory data 
can thus be used for the estimation of forest characteristics. 

It is essential to understand how image data relate to forest 
stand characteristics (Lucas and Curran 1999; Visser and 
Stampfer 2003). Empirical models including correlation and 
regression are important tools for relating field-measured stand 
parameters to remote sensing data. The strength of them is that 
they describe the measured data using a specified mathematical 
function or curve (Peng et al. 2002). In addition, regression 
analysis is easier to link two sets of data for providing continu¬ 
ous estimation of stand variables (Khatry Chhetri and Fowler 
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1996; Zas et al. 2004). Hence, regression models are popularly 
applied due to their plainness and easy to be constructed and 
understood. 

The main objective of this study is to estimate forest parame¬ 
ters on the basis of TM data. Sub-objectives are to (1) study the 
correlation between individual band of TM data and forest pa¬ 
rameters selected, such as forest age, forest density, mean basal 
area per tree and mean diameter at breast height per tree; (2) 
explore the correlation between the most commonly used vegeta¬ 
tion indices (NDVI and VI) and forest parameters selected and (3) 
evaluate the relationship between different band combination and 
forest parameters by regression model and determine which band 
or band composition is the most suitable for corresponding stand 
parameters and why by stepwise regression. 

Materials and method 

Site description 

The study area is located in Duval County (30° 19' 24" N, 81° 40' 
29" W) of northeast Florida (Fig. 1). Physiographically, it is 
within Coastal Plain, which is characterized by a series of an¬ 
cient marine terraces oriented parallel to the present coastline. 
The area has relatively flat terrain and is less than 100 m in ele¬ 
vation. Sandy soil is predominant soil type in the study area and 
according to the climatological data it has mean annual precipita¬ 
tion of 1,352 mm and mean annual temperature of 20.4°C. Most 
of the study area is forested, primarily with two southern pine 
species, slash pine (Pinus elliottii) and longleaf pine (Pinus pal- 
ustris). Another major forest type is bottomland swamp forests 
occurring on areas of relatively poor drainage. Pond cypress 
(Taxodium distichum var. nutans), bald cypress (Taxodium disti- 
chum) and other broadleaf hardwoods are the dominant species 
occurring in these swamp forests. 
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Data collection 
Inventory Data 

Field data were used to construct relationships between the 
measured forest parameters and TM data. During the fieldwork, 
the forest stand boundary maps and Landsat TM color compos¬ 
ites were used to allocate the samples sites, and GPS devices 5 
were used to identify the coordinates of each site. 

In this study, 60 stands (30 m><30 m for each) were randomly 
selected within 2 different compartments, excluding the stands 
near or cross the boundary of the image data. 40 stands data were 
used for regression model construction and the other 20 stands 10 
were used for validation. The species of the stands were identi¬ 
fied as slash pine, longleaf pine, pond cypress and bald cypress, 
occupying 37%, 29%, 21% and 13%, respectively. The forest 
stand parameters selected for this study were average age (AGE, 
yr), forest density expressed by trees per square meter (FD, 15 
tree-m' 2 ), mean basal area per tree (MBA, m 2 -tree _1 ), mean di¬ 
ameter at breast height per tree (MDBH, m-tree' 1 ). 


Image data 

A cloud-free frame of Landsat-5 TM data, collected on Oct. 12, 
1984 (Fig. 1) was selected in this study. To ensure compatibility 
between image and the ground data, radiometric and atmospheric 
correction and geometrical rectification of remotely sensed data 
were performed before the research. An improved image-based 
dark object subtraction (DOS) model was used to correct atmos¬ 
pheric effect in the study. Geometric coordinate was transformed 
and registered from imagery to a standard local map projection to 
facilitate linkage with the ground data in the presence of a set of 
ground control points (GCPs) defining a mathematical transfor¬ 
mation equation. Nearest neighbor resampling was applied in 
geometrical transformation in order to minimize changes of the 
statistical properties of the data sets. In the images obtained from 
the TM scanner, each pixel was represented by a digital number 
DN from 0 to 255. Zonal statistics of GIS and correlation & re¬ 
gression of statistics were applied in this study. 



site 

two 




Fig. 1 Location of samples site and forest cover TM image (4 band in red, 5 in green and 2 in blue) of study area 


Results and discussion 

Correlation between forest parameters and TM image 

A good understanding of the relationships between forest stand 
parameters and remote sensing spectral was a prerequisite for 
effectively using appropriate image bands to estimate forest 
stand parameter. The average of TM image digital number (DN) 
values and four forest parameters selected in each of 40 stands 
were applied to conduct correlation analysis. Coefficients of 
Pearson’s correlation p and of determination R were calculated. 
Correlation with p-value less than 0.05 was assumed to be statis¬ 
tically significant. Moreover, R 2 was also used to predict the 
power of description of the regression model. The results were 
shown as table 1. 


Although most of bands statistically had a statistically signifi¬ 
cant linear relationship with almost all the parameters selected, it 
did not mean that the linear regression models established were 
good enough for parameters estimation. Since most of absolute r 
were less than 0.7, only less than 50% of the variance can be 
explained by the regression model. 

Correlation between forest parameters and vegetation index 

Relationship between vegetation indices and forest parameters 
were explored in this section. As it is well known, vegetation 
strongly reflects infrared electromagnetic wave and absorbs red 
one. Thus the difference in vegetation absorbing infrared and red 
can be considered as a crucial indicator and the important basis 
of various vegetation indices. Numerous indices were con¬ 
structed before. In this section, two most commonly applied 
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vegetation indices NDVIN (normalized vegetation index with a 
power of n) and VI (vegetation index) were used. NDIVN was 
calculated as (NIR n - Red 11 )/ (NIR n + Red 11 ) (n = 1, 2, 3). The 
results were shown as Table 1. 

Table 1 also showed that for VI or NDVIN, all R 2 were less 


than 50%, presumably half of the variance was able to be ex¬ 
plained by the linear regression models. Thus, vegetation index 
as predicator only, the same as individual band can not be made a 
convincible prediction model. 


Table 1 Correlation between TM band DN value and forest stand parameters (n=40) 


Pearson correlation 

TM1 

TM2 

TM3 

TM4 

TM5 

TM7 

VI 

NDVI1 

NDVI2 

NDVI3 


coefficient 

-0.378* 

-0.258 

-0.449* 

0.122 

-0.640* 

-0.593* 

0.583* 

0.564* 

0.543* 

0.508* 

AGE 

P 

0.006 

0.065 

0.001 

0.388 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 


R 2 

0.142 

0.067 

0.202 

0.015 

0.410 

0.352 

0.340 

0.318 

0.295 

0.258 


coefficient 

-0.283* 

-0.341* 

-0.276* 

-0.367* 

-0.205 

-0.167 

0.162 

0.184 

0.197 

0.214 

FD 

P 

0.042 

0.013 

0.047 

0.007 

0.144 

0.236 

0.250 

0.193 

0.161 

0.127 


R 2 

0.080 

0.116 

0.076 

0.135 

0.042 

0.028 

0.026 

0.034 

0.039 

0.046 


coefficient 

-0.463* 

-0.383* 

-0.542* 

0.09 

-0.738* 

-0.675* 

0.702* 

0.671* 

0.642* 

0.595* 

MDBH 

P 

0.001 

0.005 

0.000 

0.524 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 


R 2 

0.214 

0.147 

0.294 

0.008 

0.545 

0.456 

0.493 

0.450 

0.412 

0.354 


coefficient 

-0.406* 

-0.325* 

-0.485* 

0.121 

-0.675* 

-0.623* 

0.643* 

0.614* 

0.586* 

0.542* 

MBA 

P 

0.003 

0.019 

0.000 

0.392 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 


R 2 

0.165 

0.106 

0.235 

0.015 

0.456 

0.388 

0.413 

0.377 

0.343 

0.294 


Note: * means significance at 0.05 level (two-tailed t-test) 


Regression between forest parameters and TM image 

Neither single band nor vegetation index was capable of predict¬ 
ing forest parameters. In order to establish a model that can ex¬ 
plain more variance, more explanatory variables were required to 
be involved. Consequently, stepwise multi-linear regression 
method was applied in this study. Multiple stepwise regression 
analysis generated a linear equation that predicted a dependent 


variable as a function of several independent ones. Construction 
of the polynomial was performed by a best-fitting selection of 
the polynomial terms using the stepwise regression algorithm. 
The number of terms for satisfying a user-given accuracy of 
approximation was minimized. For each coefficient, the t-ratio 
tests whether the value of the coefficient was zero or not, and if 
its p-value was less than 0.05, the calculated value was consid¬ 
ered statistically significant. Table 2 illustrated the results. 


Table 2 Coefficients of stepwise multi-linear regression models for forest stand parameters 




Unstandardized 

Standardized 

Standardized 




Model 

coefficient 


sjc 


t 

P 

R 2 



Coefficients (3 

Coefficients (3 

Error 





constant 

-405.061 


105.115 

-3.853 

0.000 



TM2 

15.141 

2.564 

1.992 

7.600 

0.000 


AGE 

TM3 

-12.578 

-4.114 

1.590 

-7.913 

0.000 

0.738 


TM1 

4.405 

1.193 

2.164 

2.036 

0.047 



constant 

1583.657 


514.317 

3.079 

0.003 


FD 

TM4 

-22.720 

-0.367 

8.139 

-2.791 

0.007 

0.135 


constant 

-36.894 


7.163 

-5.151 

0.000 


MDBH 

TM5 

-0.260 

-1.699 

0.025 

-10.514 

0.000 

0.759 


TM1 

0.727 

1.067 

0.110 

6.604 

0.000 



constant 

-2.099 


0.443 

-4.738 

0.000 


MBA 

TM5 

-0.013 

-1.642 

0.002 

-8.730 

0.000 

0.673 


TM1 

0.039 

1.073 

0.007 

5.705 

0.000 



For the parameter of age, the best linear regression model was 
established using TM band 1, 2 and 3 as predicators. This model 
explained 73.8% of total variance (R 2 ) and band 3 was the most 
deterministic predicator due to the t-ratio value (-7.913). For FD, 
only one predicator band 4 was left after stepwise regression. 
Although the model was statistically significant, it could only 
explain 13.5% of total variance and was poor for prediction. For 


MDBH, the best linear regression model was established using 
TM band 1 and 5. This model explained 75.9% of total variance, 
of which band 5 had comparatively stronger influence according 
to the t-ratio value. For MBA, the best linear regression model 
was established using band 1 and 5. The multi-linear model ex¬ 
plained 67.3% of total variance and band 5, as it was of MDBH, 
had a relatively stronger influence. 
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By means of stepwise regression, the regression models estab¬ 
lished showed a stronger ability to explain more variance com¬ 
pared to that with simple band or vegetation indices. The regres¬ 
sion models listed provide the choices for predicting forest stand 
parameters. 

AGE = -405.061+ 15.141 *771/2-12.578 *TM3 +4.405 *TM1 

FD = 1583.657 - 22.720 * TM 4 

MDBH = -36.894 - 0.260 * TM5 + 0.727 * TMl 

MBA = -2.099 - 0.0133 * TM 5 + 0.0389 * TMl 

The other 20 stands data were used to verify the prediction re¬ 
sults. The results of verification showed the prediction precision 
of 79%, 34%, 81% and 75% for AGE, FD, MDBH and MBA 
respectively. 

Conclusion 

Forest stand characteristics can be described by many stand pa¬ 
rameters such as age, DBH, density and height, but these data 
collection are time and labor consuming by conventional forest 
inventory. Remote sensing is normally considered an alternative 
and assumed to be a better method for obtaining insight of forest 
characteristics. TM image data and linear regression model are 
commonly used to fulfill this task. 

The relationship between forest parameters selected and TM 
image DN values was explored by linear regression model in the 
article. It is concluded that the regression models with only sin¬ 
gle band or vegetation index as predicator are not reliable for 
applying prediction. Multi-linear regression model which syn¬ 
thesizes multi-band of TM as dependant variables is necessary to 
be introduced. The case study showed that except for FD, all 
other forest parameters can be explained and predicted by step¬ 
wise regression models, offering a better understanding of rela¬ 
tionships between bands of TM image and forest characteristics 

A weakness of this study is that the sample set was not equally 
weighted due to varied area of forest stand, which will more or 


less reduce the credibility of prediction models. Nevertheless, the 
general tendency of the interaction relationship between TM data 
and forest parameters is not changed. Moreover, the regression 
model for prediction could not be transferred to other areas be¬ 
cause the data were not calibrated to their true energy level. In 
order to obtain a transferable model, the calibration from DN 
value to reflectance should be performed firstly, which tremen¬ 
dously strengthens the flexibility of prediction model over the 
time and space. 
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